Overview

Dataset statistics

Number of variables24
Number of observations118277
Missing cells325
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory76.3 MiB
Average record size in memory676.3 B

Variable types

NUM11
CAT10
DATE3

Reproduction

Analysis started2020-04-26 17:55:51.820968
Analysis finished2020-04-26 18:00:56.451218
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Landkreis has a high cardinality: 412 distinct values High cardinality
IdLandkreis has a high cardinality: 412 distinct values High cardinality
ObjectId is highly correlated with IdBundeslandHigh Correlation
IdBundesland is highly correlated with ObjectIdHigh Correlation
NeuerTodesfall is highly correlated with AnzahlTodesfallHigh Correlation
AnzahlTodesfall is highly correlated with NeuerTodesfallHigh Correlation
Cases deaths is highly correlated with Cases confirmed and 3 other fieldsHigh Correlation
Cases confirmed is highly correlated with Cases deaths and 3 other fieldsHigh Correlation
Cases Recovered is highly correlated with Cases confirmed and 3 other fieldsHigh Correlation
Cases non-lethal is highly correlated with Cases confirmed and 3 other fieldsHigh Correlation
ratio_deaths is highly correlated with Cases confirmed and 3 other fieldsHigh Correlation
AnzahlTodesfall has 113000 (95.5%) zeros Zeros
AnzahlGenesen has 30067 (25.4%) zeros Zeros

Variables

IdBundesland
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count16
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean7.504147746010807
Minimum1.0
Maximum16.0
Zeros0
Zeros (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum1
5-th percentile3
Q15
median8
Q39
95-th percentile14
Maximum16
Range15
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.021735819
Coefficient of variation (CV)0.4026754165
Kurtosis0.4148267835
Mean7.504147746
Median Absolute Deviation (MAD)2.370551448
Skewness0.3574097156
Sum887418
Variance9.130887359
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9 29166 24.7%
 
5 24535 20.7%
 
8 22756 19.2%
 
3 7947 6.7%
 
6 6905 5.8%
 
7 5094 4.3%
 
11 5016 4.2%
 
14 3491 3.0%
 
1 2380 2.0%
 
12 2370 2.0%
 
Other values (6) 8597 7.3%
 
ValueCountFrequency (%) 
1 2380 2.0%
 
2 2272 1.9%
 
3 7947 6.7%
 
4 569 0.5%
 
5 24535 20.7%
 
ValueCountFrequency (%) 
16 1953 1.7%
 
15 1290 1.1%
 
14 3491 3.0%
 
13 618 0.5%
 
12 2370 2.0%
 

Bundesland
Categorical

Distinct count16
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
Bayern
29166
Nordrhein-Westfalen
24535
Baden-Württemberg
22756
Niedersachsen
7947
Hessen
6905
Other values (11)
26948
ValueCountFrequency (%) 
Bayern 29166 24.7%
 
Nordrhein-Westfalen 24535 20.7%
 
Baden-Württemberg 22756 19.2%
 
Niedersachsen 7947 6.7%
 
Hessen 6905 5.8%
 
Rheinland-Pfalz 5094 4.3%
 
Berlin 5016 4.2%
 
Sachsen 3491 3.0%
 
Schleswig-Holstein 2380 2.0%
 
Brandenburg 2370 2.0%
 
Other values (6) 8597 7.3%
 

Length

Max length22
Mean length12.31328153
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 23 65.7%
 
Uppercase_Letter 11 31.4%
 
Dash_Punctuation 1 2.9%
 
ValueCountFrequency (%) 
Latin 34 97.1%
 
Common 1 2.9%
 
ValueCountFrequency (%) 
ASCII 34 100.0%
 

Landkreis
Categorical

HIGH CARDINALITY
Distinct count412
Unique (%)0.3%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
SK München
 
2438
SK Hamburg
 
2272
StadtRegion Aachen
 
1369
LK Heinsberg
 
1358
LK Esslingen
 
1246
Other values (407)
109574
ValueCountFrequency (%) 
SK München 2438 2.1%
 
SK Hamburg 2272 1.9%
 
StadtRegion Aachen 1369 1.2%
 
LK Heinsberg 1358 1.1%
 
LK Esslingen 1246 1.1%
 
Region Hannover 1217 1.0%
 
LK Ludwigsburg 1205 1.0%
 
SK Köln 1065 0.9%
 
SK Frankfurt am Main 976 0.8%
 
SK Stuttgart 922 0.8%
 
Other values (402) 104189 88.1%
 

Length

Max length36
Mean length14.26399046
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 28 50.0%
 
Uppercase_Letter 23 41.1%
 
Open_Punctuation 1 1.8%
 
Dash_Punctuation 1 1.8%
 
Other_Punctuation 1 1.8%
 
Close_Punctuation 1 1.8%
 
Space_Separator 1 1.8%
 
ValueCountFrequency (%) 
Latin 51 91.1%
 
Common 5 8.9%
 
ValueCountFrequency (%) 
ASCII 52 100.0%
 

Altersgruppe
Categorical

Distinct count7
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
A35-A59
47862
A15-A34
28966
A60-A79
24580
A80+
12608
A05-A14
 
2848
Other values (2)
 
1393
ValueCountFrequency (%) 
A35-A59 47862 40.5%
 
A15-A34 28966 24.5%
 
A60-A79 24580 20.8%
 
A80+ 12608 10.7%
 
A05-A14 2848 2.4%
 
A00-A04 1211 1.0%
 
unbekannt 182 0.2%
 
(Missing) 20 < 0.1%
 

Length

Max length9
Mean length6.682609468
Min length3
ValueCountFrequency (%) 
Decimal_Number 9 47.4%
 
Lowercase_Letter 7 36.8%
 
Uppercase_Letter 1 5.3%
 
Math_Symbol 1 5.3%
 
Dash_Punctuation 1 5.3%
 
ValueCountFrequency (%) 
Common 11 57.9%
 
Latin 8 42.1%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

Geschlecht
Categorical

Distinct count3
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
W
60677
M
57183
unbekannt
 
397
ValueCountFrequency (%) 
W 60677 51.3%
 
M 57183 48.3%
 
unbekannt 397 0.3%
 
(Missing) 20 < 0.1%
 

Length

Max length9
Mean length1.027190409
Min length1
ValueCountFrequency (%) 
Lowercase_Letter 7 77.8%
 
Uppercase_Letter 2 22.2%
 
ValueCountFrequency (%) 
Latin 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

AnzahlFall
Real number (ℝ)

Distinct count40
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.303373161842428
Minimum-1.0
Maximum49.0
Zeros0
Zeros (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum-1
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum49
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.184558027
Coefficient of variation (CV)0.9088402784
Kurtosis201.6147141
Mean1.303373162
Median Absolute Deviation (MAD)0.514612983
Skewness10.779774
Sum154133
Variance1.40317772
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 99981 84.5%
 
2 11837 10.0%
 
3 3113 2.6%
 
4 1263 1.1%
 
5 636 0.5%
 
6 413 0.3%
 
7 264 0.2%
 
8 175 0.1%
 
9 124 0.1%
 
10 87 0.1%
 
Other values (30) 364 0.3%
 
ValueCountFrequency (%) 
-1 42 < 0.1%
 
1 99981 84.5%
 
2 11837 10.0%
 
3 3113 2.6%
 
4 1263 1.1%
 
ValueCountFrequency (%) 
49 1 < 0.1%
 
42 2 < 0.1%
 
38 2 < 0.1%
 
37 1 < 0.1%
 
36 2 < 0.1%
 

AnzahlTodesfall
Real number (ℝ)

HIGH CORRELATION
ZEROS
Distinct count9
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.04766736852786727
Minimum-1.0
Maximum9.0
Zeros113000
Zeros (%)95.5%
Memory size1.8 MiB

Quantile statistics

Minimum-1
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2342722029
Coefficient of variation (CV)4.914729093
Kurtosis72.40976486
Mean0.04766736853
Median Absolute Deviation (MAD)0.09114987943
Skewness6.434891355
Sum5637
Variance0.05488346506
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 113000 95.5%
 
1 4972 4.2%
 
2 218 0.2%
 
3 40 < 0.1%
 
4 14 < 0.1%
 
5 7 < 0.1%
 
-1 3 < 0.1%
 
6 2 < 0.1%
 
9 1 < 0.1%
 
(Missing) 20 < 0.1%
 
ValueCountFrequency (%) 
-1 3 < 0.1%
 
0 113000 95.5%
 
1 4972 4.2%
 
2 218 0.2%
 
3 40 < 0.1%
 
ValueCountFrequency (%) 
9 1 < 0.1%
 
6 2 < 0.1%
 
5 7 < 0.1%
 
4 14 < 0.1%
 
3 40 < 0.1%
 

ObjectId
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
Distinct count118257
Unique (%)100.0%
Missing20
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3825102.0
Minimum3765974.0
Maximum3884230.0
Zeros0
Zeros (%)0.0%
Memory size6.8 MiB

Quantile statistics

Minimum3765974
5-th percentile3771886.8
Q13795538
median3825102
Q33854666
95-th percentile3878317.2
Maximum3884230
Range118256
Interquartile range (IQR)59128

Descriptive statistics

Standard deviation34137.99973
Coefficient of variation (CV)0.008924729257
Kurtosis-1.2
Mean3825102
Median Absolute Deviation (MAD)29564.25
Skewness-8.502173037e-18
Sum4.523450872e+11
Variance1165403026
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3783526 1 < 0.1%
 
3798724 1 < 0.1%
 
3854759 1 < 0.1%
 
3867524 1 < 0.1%
 
3804047 1 < 0.1%
 
3783530 1 < 0.1%
 
3867521 1 < 0.1%
 
3791963 1 < 0.1%
 
3854755 1 < 0.1%
 
3798778 1 < 0.1%
 
Other values (118247) 118247 > 99.9%
 
(Missing) 20 < 0.1%
 
ValueCountFrequency (%) 
3765974 1 < 0.1%
 
3765975 1 < 0.1%
 
3765976 1 < 0.1%
 
3765977 1 < 0.1%
 
3765978 1 < 0.1%
 
ValueCountFrequency (%) 
3884230 1 < 0.1%
 
3884229 1 < 0.1%
 
3884228 1 < 0.1%
 
3884227 1 < 0.1%
 
3884226 1 < 0.1%
 
Distinct count75
Unique (%)0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
Minimum2020-01-28 00:00:00
Maximum2020-04-25 00:00:00
Histogram

IdLandkreis
Categorical

HIGH CARDINALITY
Distinct count412
Unique (%)0.3%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
09162
 
2438
02000
 
2272
05334
 
1369
05370
 
1358
08116
 
1246
Other values (407)
109574
ValueCountFrequency (%) 
09162 2438 2.1%
 
02000 2272 1.9%
 
05334 1369 1.2%
 
05370 1358 1.1%
 
08116 1246 1.1%
 
03241 1217 1.0%
 
08118 1205 1.0%
 
05315 1065 0.9%
 
06412 976 0.8%
 
08111 922 0.8%
 
Other values (402) 104189 88.1%
 

Length

Max length5
Mean length4.999661811
Min length3
ValueCountFrequency (%) 
Decimal_Number 10 83.3%
 
Lowercase_Letter 2 16.7%
 
ValueCountFrequency (%) 
Common 10 83.3%
 
Latin 2 16.7%
 
ValueCountFrequency (%) 
ASCII 12 100.0%
 

Datenstand
Categorical

Distinct count1
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
26.04.2020, 00:00 Uhr
118257
ValueCountFrequency (%) 
26.04.2020, 00:00 Uhr 118257 > 99.9%
 
(Missing) 20 < 0.1%
 

Length

Max length21
Mean length20.9969563
Min length3
ValueCountFrequency (%) 
Decimal_Number 4 30.8%
 
Lowercase_Letter 4 30.8%
 
Other_Punctuation 3 23.1%
 
Uppercase_Letter 1 7.7%
 
Space_Separator 1 7.7%
 
ValueCountFrequency (%) 
Common 8 61.5%
 
Latin 5 38.5%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

NeuerFall
Categorical

Distinct count3
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
0
116901
1
 
1314
-1
 
42
ValueCountFrequency (%) 
0 116901 98.8%
 
1 1314 1.1%
 
-1 42 < 0.1%
 
(Missing) 20 < 0.1%
 

Length

Max length4
Mean length3.000355099
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 2 33.3%
 
Decimal_Number 2 33.3%
 
Other_Punctuation 1 16.7%
 
Dash_Punctuation 1 16.7%
 
ValueCountFrequency (%) 
Common 4 66.7%
 
Latin 2 33.3%
 
ValueCountFrequency (%) 
ASCII 6 100.0%
 

NeuerTodesfall
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
-9
113000
0
 
5114
1
 
140
-1
 
3
ValueCountFrequency (%) 
-9 113000 95.5%
 
0 5114 4.3%
 
1 140 0.1%
 
-1 3 < 0.1%
 
(Missing) 20 < 0.1%
 

Length

Max length4
Mean length3.955409758
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 42.9%
 
Lowercase_Letter 2 28.6%
 
Other_Punctuation 1 14.3%
 
Dash_Punctuation 1 14.3%
 
ValueCountFrequency (%) 
Common 5 71.4%
 
Latin 2 28.6%
 
ValueCountFrequency (%) 
ASCII 7 100.0%
 
Distinct count97
Unique (%)0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
Minimum2020-01-01 00:00:00
Maximum2020-04-25 00:00:00
Histogram

NeuGenesen
Categorical

Distinct count4
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Memory size6.8 MiB
0
86232
-9
30067
1
 
1868
-1
 
90
ValueCountFrequency (%) 
0 86232 72.9%
 
-9 30067 25.4%
 
1 1868 1.6%
 
-1 90 0.1%
 
(Missing) 20 < 0.1%
 

Length

Max length4
Mean length3.254969267
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 42.9%
 
Lowercase_Letter 2 28.6%
 
Other_Punctuation 1 14.3%
 
Dash_Punctuation 1 14.3%
 
ValueCountFrequency (%) 
Common 5 71.4%
 
Latin 2 28.6%
 
ValueCountFrequency (%) 
ASCII 7 100.0%
 

AnzahlGenesen
Real number (ℝ)

ZEROS
Distinct count39
Unique (%)< 0.1%
Missing20
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.9464809694140728
Minimum-1.0
Maximum49.0
Zeros30067
Zeros (%)25.4%
Memory size6.8 MiB

Quantile statistics

Minimum-1
5-th percentile0
Q10
median1
Q31
95-th percentile2
Maximum49
Range50
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.094460745
Coefficient of variation (CV)1.156347333
Kurtosis214.4165819
Mean0.9464809694
Median Absolute Deviation (MAD)0.4842508536
Skewness9.783965035
Sum111928
Variance1.197844321
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 75043 63.4%
 
0 30067 25.4%
 
2 8899 7.5%
 
3 2190 1.9%
 
4 796 0.7%
 
5 387 0.3%
 
6 233 0.2%
 
7 157 0.1%
 
8 101 0.1%
 
-1 90 0.1%
 
Other values (29) 294 0.2%
 
ValueCountFrequency (%) 
-1 90 0.1%
 
0 30067 25.4%
 
1 75043 63.4%
 
2 8899 7.5%
 
3 2190 1.9%
 
ValueCountFrequency (%) 
49 1 < 0.1%
 
42 2 < 0.1%
 
37 1 < 0.1%
 
36 2 < 0.1%
 
33 1 < 0.1%
 

Country/Region
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.8 MiB
Germany
118277
ValueCountFrequency (%) 
Germany 118277 100.0%
 

Length

Max length7
Mean length7
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 6 85.7%
 
Uppercase_Letter 1 14.3%
 
ValueCountFrequency (%) 
Latin 7 100.0%
 
ValueCountFrequency (%) 
ASCII 7 100.0%
 

Date
Date

Distinct count95
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 MiB
Minimum2020-01-22 00:00:00
Maximum2020-04-25 00:00:00
Histogram

Cases confirmed
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count71
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77907.81599127472
Minimum0
Maximum156513
Zeros5
Zeros (%)< 0.1%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile5795
Q137323
median77872
Q3118181
95-th percentile148291
Maximum156513
Range156513
Interquartile range (IQR)80858

Descriptive statistics

Standard deviation46190.21631
Coefficient of variation (CV)0.5928829569
Kurtosis-1.24053309
Mean77907.81599
Median Absolute Deviation (MAD)40097.44855
Skewness-0.03242792417
Sum9214702752
Variance2133536083
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000000e+00 1.500000e+01 1.650000e+01 3.650000e+01 6.350000e+01 ... 1.461245e+05 1.476780e+05 1.494695e+05 1.557560e+05 1.565130e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
84794 4927 4.2%
 
77872 4813 4.1%
 
91159 4657 3.9%
 
71808 4599 3.9%
 
50871 4489 3.8%
 
43938 4417 3.7%
 
37323 4193 3.5%
 
113296 4066 3.4%
 
107663 4043 3.4%
 
118181 3777 3.2%
 
Other values (61) 74296 62.8%
 
ValueCountFrequency (%) 
0 5 < 0.1%
 
1 1 < 0.1%
 
4 5 < 0.1%
 
5 4 < 0.1%
 
8 1 < 0.1%
 
ValueCountFrequency (%) 
156513 453 0.4%
 
154999 1228 1.0%
 
153129 1517 1.3%
 
150648 1806 1.5%
 
148291 1718 1.5%
 

Cases deaths
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count47
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1558.427352739755
Minimum0
Maximum5877
Zeros948
Zeros (%)0.8%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile11
Q1206
median920
Q32607
95-th percentile5033
Maximum5877
Range5877
Interquartile range (IQR)2401

Descriptive statistics

Standard deviation1630.516699
Coefficient of variation (CV)1.046257752
Kurtosis-0.07903768726
Mean1558.427353
Median Absolute Deviation (MAD)1354.132757
Skewness1.021408187
Sum184326112
Variance2658584.707
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 1.0000e+00 5.0000e+00 8.0000e+00 1.0000e+01 ... 4.5225e+03 4.9475e+03 5.1560e+03 5.6675e+03 5.8770e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1107 4927 4.2%
 
920 4813 4.1%
 
1275 4657 3.9%
 
775 4599 3.9%
 
342 4489 3.8%
 
267 4417 3.7%
 
206 4193 3.5%
 
2349 4066 3.4%
 
2016 4043 3.4%
 
2607 3777 3.2%
 
Other values (37) 74296 62.8%
 
ValueCountFrequency (%) 
0 948 0.8%
 
2 823 0.7%
 
3 1470 1.2%
 
7 1125 1.0%
 
9 936 0.8%
 
ValueCountFrequency (%) 
5877 453 0.4%
 
5760 1228 1.0%
 
5575 1517 1.3%
 
5279 1806 1.5%
 
5033 1718 1.5%
 

Cases Recovered
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count47
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29806.39945213355
Minimum0.0
Maximum109800.0
Zeros32
Zeros (%)< 0.1%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile46
Q13547
median18700
Q352407
95-th percentile95200
Maximum109800
Range109800
Interquartile range (IQR)48860

Descriptive statistics

Standard deviation31300.13795
Coefficient of variation (CV)1.050114691
Kurtosis-0.2505260738
Mean29806.39945
Median Absolute Deviation (MAD)26017.80681
Skewness0.9688453174
Sum3525411508
Variance979698635.5
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 1.3000e+01 1.5500e+01 1.6500e+01 ... 8.6700e+04 9.3350e+04 1.0135e+05 1.0655e+05 1.0980e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
28700 4987 4.2%
 
22440 4927 4.2%
 
18700 4813 4.1%
 
24575 4657 3.9%
 
266 4638 3.9%
 
16100 4599 3.9%
 
6658 4489 3.8%
 
5673 4417 3.7%
 
3547 4193 3.5%
 
46300 4066 3.4%
 
Other values (37) 72491 61.3%
 
ValueCountFrequency (%) 
0 32 < 0.1%
 
1 5 < 0.1%
 
12 3 < 0.1%
 
14 9 < 0.1%
 
15 8 < 0.1%
 
ValueCountFrequency (%) 
109800 1681 1.4%
 
103300 1517 1.3%
 
99400 1806 1.5%
 
95200 1718 1.5%
 
91500 1296 1.1%
 

Cases active
Real number (ℝ≥0)

Distinct count72
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46542.98918640141
Minimum0.0
Maximum72864.0
Zeros5
Zeros (%)< 0.1%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile5738
Q133570
median52740
Q363167
95-th percentile69566
Maximum72864
Range72864
Interquartile range (IQR)29597

Descriptive statistics

Standard deviation19951.3078
Coefficient of variation (CV)0.4286640833
Kurtosis-0.5251794794
Mean46542.98919
Median Absolute Deviation (MAD)16716.08885
Skewness-0.78442387
Sum5504965132
Variance398054682.8
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000e+00 4.50000e+00 1.10000e+01 1.25000e+01 4.75000e+01 ... 6.54000e+04 6.89070e+04 6.97025e+04 7.13515e+04 7.28640e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
61247 4927 4.2%
 
58252 4813 4.1%
 
65309 4657 3.9%
 
54933 4599 3.9%
 
43871 4489 3.8%
 
37998 4417 3.7%
 
33570 4193 3.5%
 
64647 4066 3.4%
 
69566 4043 3.4%
 
63167 3777 3.2%
 
Other values (62) 74296 62.8%
 
ValueCountFrequency (%) 
0 5 < 0.1%
 
1 1 < 0.1%
 
2 7 < 0.1%
 
3 2 < 0.1%
 
4 8 < 0.1%
 
ValueCountFrequency (%) 
72864 2937 2.5%
 
69839 2050 1.7%
 
69566 4043 3.4%
 
68248 3396 2.9%
 
65491 2609 2.2%
 

Cases non-lethal
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count71
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76349.38863853496
Minimum0
Maximum150636
Zeros5
Zeros (%)< 0.1%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile5784
Q137117
median76952
Q3115574
95-th percentile143258
Maximum150636
Range150636
Interquartile range (IQR)78457

Descriptive statistics

Standard deviation44689.58856
Coefficient of variation (CV)0.5853300119
Kurtosis-1.248316835
Mean76349.38864
Median Absolute Deviation (MAD)38858.85926
Skewness-0.06769923816
Sum9030376640
Variance1997159326
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000000e+00 1.500000e+01 1.650000e+01 3.650000e+01 6.350000e+01 ... 1.397405e+05 1.414005e+05 1.443135e+05 1.499375e+05 1.506360e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
83687 4927 4.2%
 
76952 4813 4.1%
 
89884 4657 3.9%
 
71033 4599 3.9%
 
50529 4489 3.8%
 
43671 4417 3.7%
 
37117 4193 3.5%
 
110947 4066 3.4%
 
105647 4043 3.4%
 
115574 3777 3.2%
 
Other values (61) 74296 62.8%
 
ValueCountFrequency (%) 
0 5 < 0.1%
 
1 1 < 0.1%
 
4 5 < 0.1%
 
5 4 < 0.1%
 
8 1 < 0.1%
 
ValueCountFrequency (%) 
150636 453 0.4%
 
149239 1228 1.0%
 
147554 1517 1.3%
 
145369 1806 1.5%
 
143258 1718 1.5%
 

ratio_deaths
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count49
Unique (%)< 0.1%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.4140712299647697
Minimum0.0
Maximum3.754959651913899
Zeros943
Zeros (%)0.8%
Memory size6.8 MiB

Quantile statistics

Minimum0
5-th percentile0.1962922574
Q10.551938483
median1.18142593
Q32.190412143
95-th percentile3.394002333
Maximum3.754959652
Range3.754959652
Interquartile range (IQR)1.63847366

Descriptive statistics

Standard deviation1.020140528
Coefficient of variation (CV)0.7214208917
Kurtosis-0.7230149558
Mean1.41407123
Median Absolute Deviation (MAD)0.8613582077
Skewness0.6289221265
Sum167245.0325
Variance1.040686696
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.3055169 4927 4.2%
 
1.18142593 4813 4.1%
 
1.398655097 4657 3.9%
 
1.079266934 4599 3.9%
 
0.6722887303 4489 3.8%
 
0.6076744504 4417 3.7%
 
0.551938483 4193 3.5%
 
2.073330038 4066 3.4%
 
1.87250959 4043 3.4%
 
2.205938349 3777 3.2%
 
Other values (39) 74291 62.8%
 
ValueCountFrequency (%) 
0 943 0.8%
 
0.1372683596 520 0.4%
 
0.1443695861 813 0.7%
 
0.1572327044 657 0.6%
 
0.1700680272 303 0.3%
 
ValueCountFrequency (%) 
3.754959652 453 0.4%
 
3.716153007 1228 1.0%
 
3.640721222 1517 1.3%
 
3.50419521 1806 1.5%
 
3.394002333 1718 1.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

IdBundeslandBundeslandLandkreisAltersgruppeGeschlechtAnzahlFallAnzahlTodesfallObjectIdMeldedatumIdLandkreisDatenstandNeuerFallNeuerTodesfallRefdatumNeuGenesenAnzahlGenesenCountry/RegionDateCases confirmedCases deathsCases RecoveredCases activeCases non-lethalratio_deaths
01.0Schleswig-HolsteinSK FlensburgA15-A34M1.00.03765974.02020-03-140100126.04.2020, 00:00 Uhr0.0-9.02020-03-160.01.0Germany2020-03-144585946.04530.045760.196292
11.0Schleswig-HolsteinSK FlensburgA15-A34W1.00.03765983.02020-03-140100126.04.2020, 00:00 Uhr0.0-9.02020-03-120.01.0Germany2020-03-144585946.04530.045760.196292
21.0Schleswig-HolsteinSK FlensburgA35-A59M1.00.03765987.02020-03-140100126.04.2020, 00:00 Uhr0.0-9.02020-03-160.01.0Germany2020-03-144585946.04530.045760.196292
31.0Schleswig-HolsteinSK FlensburgA35-A59W1.00.03765989.02020-03-140100126.04.2020, 00:00 Uhr0.0-9.02020-03-100.01.0Germany2020-03-144585946.04530.045760.196292
41.0Schleswig-HolsteinSK KielA15-A34M1.00.03766013.02020-03-140100226.04.2020, 00:00 Uhr0.0-9.02020-03-120.01.0Germany2020-03-144585946.04530.045760.196292
51.0Schleswig-HolsteinLK PinnebergA15-A34M1.00.03766879.02020-03-140105626.04.2020, 00:00 Uhr0.0-9.02020-03-120.01.0Germany2020-03-144585946.04530.045760.196292
61.0Schleswig-HolsteinLK PinnebergA15-A34W1.00.03766932.02020-03-140105626.04.2020, 00:00 Uhr0.0-9.02020-03-130.01.0Germany2020-03-144585946.04530.045760.196292
71.0Schleswig-HolsteinLK PinnebergA35-A59M2.00.03766981.02020-03-140105626.04.2020, 00:00 Uhr0.0-9.02020-03-100.02.0Germany2020-03-144585946.04530.045760.196292
81.0Schleswig-HolsteinLK PinnebergA35-A59M1.00.03766982.02020-03-140105626.04.2020, 00:00 Uhr0.0-9.02020-03-120.01.0Germany2020-03-144585946.04530.045760.196292
91.0Schleswig-HolsteinLK PinnebergA35-A59W1.00.03767071.02020-03-140105626.04.2020, 00:00 Uhr0.0-9.02020-03-100.01.0Germany2020-03-144585946.04530.045760.196292

Last rows

IdBundeslandBundeslandLandkreisAltersgruppeGeschlechtAnzahlFallAnzahlTodesfallObjectIdMeldedatumIdLandkreisDatenstandNeuerFallNeuerTodesfallRefdatumNeuGenesenAnzahlGenesenCountry/RegionDateCases confirmedCases deathsCases RecoveredCases activeCases non-lethalratio_deaths
118267NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-081300.013.0130.0
118268NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-091400.014.0140.0
118269NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-101400.014.0140.0
118270NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-131601.015.0160.0
118271NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-141601.015.0160.0
118272NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-151601.015.0160.0
118273NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-171601.015.0160.0
118274NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-1816012.04.0160.0
118275NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-1916012.04.0160.0
118276NaNNaNNaNNaNNaNNaNNaNNaNNaTNaNNaNNaNNaNNaTNaNNaNGermany2020-02-2116014.02.0160.0